Towards Contactless Silent Speech Recognition Based on Detection of Active and Visible Articulators Using IR-UWB Radar
نویسندگان
چکیده
People with hearing or speaking disabilities are deprived of the benefits of conventional speech recognition technology because it is based on acoustic signals. Recent research has focused on silent speech recognition systems that are based on the motions of a speaker's vocal tract and articulators. Because most silent speech recognition systems use contact sensors that are very inconvenient to users or optical systems that are susceptible to environmental interference, a contactless and robust solution is hence required. Toward this objective, this paper presents a series of signal processing algorithms for a contactless silent speech recognition system using an impulse radio ultra-wide band (IR-UWB) radar. The IR-UWB radar is used to remotely and wirelessly detect motions of the lips and jaw. In order to extract the necessary features of lip and jaw motions from the received radar signals, we propose a feature extraction algorithm. The proposed algorithm noticeably improved speech recognition performance compared to the existing algorithm during our word recognition test with five speakers. We also propose a speech activity detection algorithm to automatically select speech segments from continuous input signals. Thus, speech recognition processing is performed only when speech segments are detected. Our testbed consists of commercial off-the-shelf radar products, and the proposed algorithms are readily applicable without designing specialized radar hardware for silent speech processing.
منابع مشابه
A Matched Filtering Technique for Noninvasive Monitoring of Human Respiration Using Ir-uwb Radar
Impulse Radio Ultra Wideband (IR-UWB) signals are excellent candidates for use in radars for detection and monitoring of movements with small amplitude such as human breathing in presence of obstacles. This paper reports a method for noninvasive monitoring of human respiration using IR-UWB radar that is based on the detection of periodic variation in the time of arrival of the reflected pulse f...
متن کامل`Through-wall human being detection using UWB impulse radar
Ultra-wideband (UWB) impulse radar plays an important role in contactless vital sign (VS) detection. The VS can be extracted remotely by acquiring the oscillations in the human chest. Unfortunately, it is usually challenging to identify VS due to the low signal-to-noise ratio (SNR) only based on the traditional fast Fourier transform (FFT) especially in complicated conditions. To extract VS acc...
متن کاملIntegration of Face and Voice Recognition
cepstral features and features based on a bio–mechanical model of the visible articulators will be the identity–carrying characteristics extracted from acoustic speech and visual speech respectively. Speakers will be modelled by multi–layer perceptrons trained as discriminative models or, alternatively, as predictive models. In the discriminative modelling scheme, each speaker model will be tra...
متن کاملLearning Spoken Words via the Ears and Eyes: Evidence from 30-Month-Old Children
From the very first moments of their lives, infants are able to link specific movements of the visual articulators to auditory speech signals. However, recent evidence indicates that infants focus primarily on auditory speech signals when learning new words. Here, we ask whether 30-month-old children are able to learn new words based solely on visible speech information, and whether information...
متن کاملEvaluation of a silent speech interface based on magnetic sensing
This paper reports on isolated word recognition experiments using a novel silent speech interface. The interface consist of magnetic pellets that are fixed to relevant speech articulators, and a set of magnetic field sensors that measure changes in the overall magnetic field created by these pellets during speech. The reported experiments demonstrate the effectiveness of this technique and show...
متن کامل